A A Separability Framework for Analyzing Community Structure

نویسندگان

  • BRUNO ABRAHAO
  • SUCHETA SOUNDARAJAN
  • JOHN HOPCROFT
  • ROBERT KLEINBERG
چکیده

Four major factors govern the intricacies of community extraction in networks: (1) the literature offers a multitude of disparate community detection algorithms whose output exhibits high structural variability across the collection, (2) communities identified by algorithms may differ structurally from real communities that arise in practice, (3) there is no consensus characterizing how to discriminate communities from non-communities, and (4) the application domain includes a wide variety of networks of fundamentally different natures. In this paper, we present a class separability framework to tackle these challenges through a comprehensive analysis of community properties. Our approach enables the assessment of the structural dissimilarity among the output of multiple community detection algorithms and between the output of algorithms and communities that arise in practice. In addition, our method provides us with a way to organize the vast collection of community detection algorithms by grouping those that behave similarly. Finally, we identify the most discriminative graph-theoretical properties of community signature and the small subset of properties that account for most of the biases of the different community detection algorithms. We illustrate our approach with an experimental analysis, which reveals nuances of the structure of real and extracted communities. In our experiments, we furnish our framework with the output of ten different community detection procedures, representative of categories of popular algorithms available in the literature, applied to a diverse collection of large-scale real network datasets whose domains span biology, on-line shopping, and social systems. We also analyze communities identified by annotations that accompany the data, which reflect exemplar communities in various domain. We characterize these communities using a broad spectrum of community properties to produce the different structural classes. As our experiments show that community structure is not a universal concept, our framework enables an informed choice of the most suitable community detection method for identifying communities of a specific type in a given network and allows for a comparison of existing community detection algorithms while guiding the design of new ones.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Gender and community development in Iran with emphasize in driving force the case study: Nemat Abad Tehran

One of the UN Millennium Development Goals is women's participation in urban management. This article develops a theoretical framework for analyzing the relationship between community- based planning and women participation in cities. In this regard, collective action, social capital, and neighborhood as location for community planning are used. The framework identifies a series of variables th...

متن کامل

Determination of Spatial-Temporal Correlation Structure of Troposphere Ozone Data in Tehran City

Spatial-temporal modeling of air pollutants, ground-level ozone concentrations in particular, has attracted recent attention because by using spatial-temporal modeling, can analyze, interpolate or predict ozone levels at any location. In this paper we consider daily averages of troposphere ozone over Tehran city. For eliminating the trend of data, a dynamic linear model is used, then some featu...

متن کامل

Indicators Developed to Evaluate the International Framework Convention on Tobacco Control in Iran; A Grounded Theory Study

This study aimed to develop indicators for evaluating the implementation of The Framework Convention on Tobacco Control (FCTC) in Iran. We used the “grounded theory” framework. Totally, 265 policy-makers, stakeholders, and community members were recruited by purposeful sampling in 2008. After analyzing the gathered data, 251 indicators, including 82 indicators as “applied indicators”, were deri...

متن کامل

Evaluation of Tests for Separability and Symmetry of Spatio-temporal Covariance Function

In recent years, some investigations have been carried out to examine the assumptions like stationarity, symmetry and separability of spatio-temporal covariance function which would considerably simplify fitting a valid covariance model to the data by parametric and nonparametric methods. In this article, assuming a Gaussian random field, we consider the likelihood ratio separability test, a va...

متن کامل

کاهش ابعاد داده‌های ابرطیفی به منظور افزایش جدایی‌پذیری کلاس‌ها و حفظ ساختار داده

Hyperspectral imaging with gathering hundreds spectral bands from the surface of the Earth allows us to separate materials with similar spectrum. Hyperspectral images can be used in many applications such as land chemical and physical parameter estimation, classification, target detection, unmixing, and so on. Among these applications, classification is especially interested. A hyperspectral im...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013